Information collection system for a communication network with language translation capabilities

ABSTRACT

An information collection system comprises a user interface unit for executing input/output information with respect to a user, an external interface unit for exchanging various types of information with an external unit, a user model determination unit for preparing at least one of user information for discriminating information required by the user, user information for defining an information proposition method, and user information for defining an information modification method in accordance with information input from the user interface unit, an information drawing-out unit for drawing out and modifying information input from the external interface unit in accordance with the user information acquired from the user model determination unit, an information proposition processing unit for converting information acquired by the information drawing-out unit into a proposition form for the user in accordance with the information acquired from the user model determination unit, and a control unit for receiving information from the individual units mentioned above, and executing control of the individual units and information exchange therewith in accordance with the received information.

This is a continuation of application Ser. No. 07/762,373, filed on Sep.19, 1991, now U.S. Pat. No. 5,673,428.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to an information collecting systemcapable of collecting (i.e., drawing out or extracting and processing)only that information among all the information available through aninformation network or the like which a user requests in a form desiredby the user at a user-desired timing to thereby propose or output onlythe information requested by the user.

2. Description of the Related Art

Recently, as the information processing technology using an electroniccomputer and the technology for information communication via variouscommunication networks are improved, multifarious data are accumulatedas information and the accumulated information can be referred to fromvarious locations. Recently, as a so-called personal computercommunication system or electric mail service becomes popular, it ispossible to acquire a vast amount of a very wide variety of information.

According to a conventional way of collecting information using acomputer, a user accesses a database using a command language (e.g.,key-word searching etc.) to extract the desired information from thedatabase. The format of information present in an information source,such as a database, and the amount of information what is desired by theuser. It is therefore considerably difficult to extract only the desiredinformation for information collection. Unless the user accesses theinformation source, the user cannot acquire deadlock detectioninformation therefrom. When the user neglects accessing the informationsource, he is likely to undesirably overlook useful information presentin the information source. When the user constantly accesses everyavailable information source to obtain information, the desiredinformation is easily buried in a huge amount of multifariousinformation accessible from the information sources. This conventionalmethod easily causes the user to overlook the desired information.

As described above, conventionally, every time a user wants to collectinformation, the user should access available information sources.Further, when the user neglects accessing information, the user mayeasily overlook useful information if such exists. Of all theinformation accumulated in a database or the like designed fornon-individual, there is often very little information a user wants. Theconventional information collection therefore undesirably requirestedious works of:

1) screening or filtering excess information from what is obtainablefrom an information source, and

2) extracting only the desired information from the availableinformation and processing it.

SUMMARY OF THE INVENTION

An object of the present invention is to provide an informationcollection system, which always monitors information available through acomputer network or the like, and, upon finding information preset andrequested by a user, can accurately collect only that user-requestedinformation by subjecting the information to processing, such assummarizing or drawing out, and proposing the processed information in aform and quantity desired by the user at a user-desired timing.

An information collection system according to the present inventioncomprises:

a user interface unit for executing input/output information withrespect to a user;

an external interface unit for exchanging various types of informationwith an external unit;

a user model determination unit for preparing at least one of userinformation for discriminating information required by the user, userinformation for defining an information proposition method, and userinformation for defining an information modification method inaccordance with information input from the user interface unit;

an information drawing-out unit for drawing out and modifyinginformation input from the external interface unit in accordance withthe user information acquired from the user model determination unit;

an information proposition processing unit for converting informationacquired by the information drawing-out unit into a proposition form forthe user in accordance with the information acquired from the user modeldetermination unit; and

a control unit for receiving information from the individual unitsmentioned above, and executing control of the individual units andinformation exchange therewith in accordance with the receivedinformation.

This information collection system is characterized in that theinformation proposition processing unit includes a function to integratemultiple pieces of information acquired through the external interfaceunit respectively from multiple different information sources andconverting the integrated information into a proposing form for a user,and a function to change a proposition method for information acquiredthrough the external interface unit and a proposing amount of theinformation in accordance with an elapsing time from an input time ofthe information or an amount of information drawn out and modified bythe information drawing-out unit.

In addition, this system is characterized in that the control unitincludes means for accessing an external database via the externalinterface unit in accordance with the user information proposed by theuser model determination unit, and a function to propose and output theinformation converted by the information proposition processing unitthrough the user interface unit at a timing determined in accordancewith the user information proposed by the user model determination unit.

According to the information collection system embodying the presentinvention, first, a user invokes the user model determination unit whileinteracting with the system through the user interface unit. Then, theuser causes the user model determination unit to prepare userinformation including a condition concerning the desired information, aninformation proposition method, modifications to be added to the desiredinformation, and the like. Finally, the user stores the prepared userinformation into the data storage unit, or corrects the user informationalready determined by the user model determination unit.

The control unit properly accesses the information source specified inadvance in accordance with the user information by the user. The controlunit properly fetches information flowing in an external network via theexternal interface unit, and gives the fetched information to theinformation drawing-out unit. The information drawing-out unit analyzesthe information and determines whether it is necessary or unnecessary inaccordance with the user information. If the fetched information is thedesired one, the information drawing-out unit performs user-desiredprocessing, such as data modification, and accumulates the resultantinformation into the data storage unit.

Thereafter, the control unit invokes the information propositionprocessing unit whenever needed based on the user information. Theinformation proposition processing unit performs integration orconversion of various types of information obtained by the informationdrawing-out unit in accordance with the user information, when needed.The control unit proposes the integrated or converted information fromthe information proposition processing unit to the user via the userinterface unit.

The thus structured information collection system embodying the presentinvention can permit a user to properly acquire only the desiredinformation among a vast amount of multifarious information in thedesired proposition form, thereby ensuring easy and timely informationcollection without overlooking information.

The information collection system according to the present inventionperforms the following operations:

1) Permit a user to enter an information source from which informationis to be collected, a condition concerning the desired information, aproposition method for that information, modifications to be added tothe information, and the like.

2) Automatically accesses information via the external interface unit ormonitors automatically-receiving information, and selects only theuser-desired information among the accessed or received information tothereby collect the information.

3) Modifies the information collected by the user, if necessary.

According to the present system, therefore, the necessary informationcan be proposed and output in the user-requested form. This producespractically significant effects including:

1) The information collection can be executed efficiently and withoutoverlooking any information.

2) Accurate information collection can be conducted effectively.

Additional objects and advantages of the invention will be set forth inthe description which follows, and in part will be obvious from thedescription, or may be learned by practice of the invention. The objectsand advantages of the invention may be realized and obtained by means ofthe instrumentalities and combinations particularly pointed out in theappended claims.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are incorporated in and constitute apart of the specification, illustrates a presently preferred embodimentof the invention and, together with the general description given aboveand the detailed description of the preferred embodiment given below,serve to explain the principles of the invention.

FIG. 1 is a diagram illustrating the general structure of an informationcollection system according to one embodiment of the present invention;

FIG. 2 is a flowchart showing the operation of a user modeldetermination unit;

FIG. 3 is a diagram illustrating the contents of a user informationunit;

FIG. 4 is a flowchart representing identification processing done by theuser information unit;

FIG. 5 is a diagram illustrating a screen menu for identification ofuser information unit;

FIG. 6 is a diagram showing a screen to determine an information sourceas a search object;

FIG. 7 is a diagram illustrating the data structure of informationsources;

FIG. 8 is a diagram showing a screen for determining search conditions;

FIG. 9 is a flowchart for acquiring a key word group from a single keyword;

FIG. 10 is a diagram exemplifying a key word thesaurus;

FIG. 11 is a flowchart showing how to describe the conditions ofcontents;

FIG. 12 is a diagram illustrating how data is transferred in processingthe contents conditions;

FIG. 13 is a diagram showing a screen for determining propositioncontents;

FIG. 14 is a diagram of a screen to determine a proposition method, adata processing and a data modification;

FIGS. 15A and 15B are flowcharts representing the processing of acontrol unit;

FIG. 16 is a flowchart representing the processing of an informationdrawing-out unit;

FIG. 17 is a diagram illustrating a semantic structure of a samplesentence;

FIGS. 18A and 18B are flowcharts showing the operation of theinformation draw-out unit;

FIG. 19 is a diagram exemplifying the relation between informationstored in a data storage unit and the user information units;

FIGS. 20A and 20B are diagrams of sample data generally defined as aproposition method;

FIG. 21 is a flowchart showing a summarizing process;

FIGS. 22A and 22B are diagrams exemplifying the user information units;

FIG. 23 is a diagram illustrating an example of information acquired bypatent database;

FIG. 24 is a diagram illustrating an example of information acquired bya telex line; and

FIG. 25 is a diagram exemplifying the proposition contents of thesystem.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

An information collecting system according to one embodiment of thepresent invention will now be described referring to the accompanyingdrawings.

FIG. 1 illustrates the general schematic structure of the systemaccording to this embodiment.

The system includes a user interface unit 1, an external interface unit2, a data storage unit 3, a user model determination unit 4, aninformation drawing-out unit 5, an information proposition processingunit 6 and a control unit 7.

The user interface unit 1 including a keyboard and a mouse etc, has thefunction for receiving information sent from a user, displayinginformation and outputting (or printing) information on paper. Theexternal interface unit 2 connected to an external database or acommunication line performs transmission and reception of external data.

As an example of external information sources, a patent database and atelex information line will be described in the system of thisembodiment. The two information sources are to be accessed through theexternal interface unit 2.

The data storage unit 3 stores, calls and corrects (updates) userinformation to be described later and information collected via theexternal interface unit 2, both necessary for the processing in thissystem. Through the interactive communication with the user via the userinterface unit 1, the user model determination unit 4 determines thecontents of processing to be added to information (for example, (1) userinformation for discriminating which information the user requires andwhich the user does not among received information, (2) user informationneeded to specify the form of information proposition to the user andthe proposition timing, and (3) information requested from the outsideof the system), and also prepares user information to specify how toprocess the contents of the processing. Based on the user informationprepared by the user model determination unit 4, the informationdrawing-out unit 5 discriminates, draws out and modifies variousinformation obtained via the external interface unit 2. The informationproposition processing unit 6 arranges, integrates and edits variousinformation having undergone the discrimination, drawing-out,modification, etc. in the information drawing-out unit 5, in accordancewith the user information. The processed information is then proposed tothe user via the user interface unit 1. The control unit 7 controls theabove-described individual units.

The functions of the individual units in this system will now beexplained in detail.

FIG. 2 shows the general operation of the user model determination unit4. The user model determination unit 4 interactively communicates withthe user via the user interface unit 1 and prepares user information,identifies the user information with that in the data storage unit 3,and erases and edits existing user information stored in the datastorage unit 3.

The specific processing of the user model determination unit 4 todetermine a user model will be described below.

The user model determination unit 4 searches user information previouslystored in the data storage unit 3 and displays the searched userinformation via the user interface unit 1 (step A1). The user modeldetermination unit 4 receives a command from the user to the displayedinformation, and analyzes it (step A2). In the case of receiving an END!command from the user in step A2, the user model determination unit 4follows that command, and stores user information unit (to be describedlater) at the time of reception of the command, and terminates theprocessing.

The user can also select three other commands, ERASE!, EDIT! andIDENTIFY! in step A2.

When receiving the ERASE! command from the user interface unit 1, theuser model determination unit 4 assigns user information unit (or a usermodel) to be erased via the user interface unit 1 (step A3), and erasesthat user information unit (step A4). After the user information iserased in step A4, the user model determination unit 4 does not collectinformation about the contents of the erased information.

If the user has input the EDIT! command, the user model determinationunit 4 performs a user model edit processing to the user informationunit (step A5). When the IDENTIFY! command is input, the user modeldetermination unit 4 performs a user model identification processing(step A6).

The user information unit defines a group of various user information.If the user sets the user information unit in his preferable form, hecan obtain the necessary information in the desired form and at thedesired timing. The data storage unit 3 in the system according to thisembodiment can store multiple user information units.

FIG. 3 shows an example of the items that the user information unit has.Information of the individual items is basically set by the userinterface unit 1 in the user model identification processing (step A6).

A search object a is user information for specifying an informationsource from which information is to be collected. Based on userinformation indicated by the search object a, information sources (suchas database, information, information base station and news, allaccessible via the external interface unit 2 of the system) are set tothe individual user information units. Multiple information sources canbe set in a single user information unit. It is possible to assign, forexample, multiple information sources, such as A newspaper and Bnewspaper, or assign an information group (which includes multipleinformation sources, for example, the A newspaper and B newspaper, inthe case of assigning newspapers). A search condition b defines userinformation to discriminate information the user desires fromunnecessary information. According to the search condition b, a matchingprocess is performed between various information supplied from theinformation sources and the user-requesting information.

Indication contents c are user information for determining which part ofinformation (one part or the entire part) acquired through the matchingprocess to be described later should be proposed to the user. Anindication method d is user information of when and in what form theacquired information should be proposed to the user. A data proceeding eis used to determine how the acquired information is to be saved orerased. A data modification f indicates a method for modification whenthe information should be modified.

FIG. 4 illustrates an identification processing conducted by the usermodel determination unit 4 to set the user information shown in FIG. 3.

The processing flow shown in FIG. 4 will now be described. First,information of an item to be identified by the user interface unit 1 isproposed to the user, for example, on a user screen as shown in FIG. 5(step B1). The user selects one of seven icons (selection buttons) onthe user screen, for example, so that he can instruct execution of theprocess indicated by that icon. Which icon is selected is determined anddetected in an input determination processing (steps B2 to B6).

When the user selects the SEARCH OBJECT!, the determination of searchobjects is performed in step B7 through the input determinationprocessing (step B2). In step B7, an information source alreadyidentified in the system is proposed to the user via the user interfaceunit 1, by the display shown in FIG. 6, for example. Information of theinformation source as shown in FIG. 6 is prestored in the data storageunit 3 as having the data structure shown in FIG. 7, for example.

Information of information sources stored in the data storage unit 3 isexpressed as a data structure having various information sourcesdesigned in a hierarchical tree-structure. Leaf portions of thetree-structured data each correspond to various information sourcesaccessible through the external interface unit 2. Branch portions of thetree-structured data are each a group of information, and eachcorrespond to several information sources or information groups groupedtogether. Items reflecting the above tree structure are displayed on ascreen for the aforementioned determination of search objects shown inFIG. 6. It is possible to determine a search object while displayingmore detailed portions as needed. A user may designate an informationgroup to specify multiple information sources included in thatinformation group at a time.

A data storage area, which is specified by a leaf portion of thetree-structured data, is an area to hold various informationcorresponding to each information source. In this area the names ofinformation sources, the types of the information sources (e.g., thetypes of various databases, and the types of information input lines),and methods of accessing the information sources (e.g., addresses,communication protocols, interactive protocols, etc.) are described. Theinformation of any information source accessible through the externalinterface unit 2 can be acquired by referring to table information(i.e., the data storage area) which has a data structure with thementioned area. As a result, the user can easily and efficiently collectinformation from available information sources. The user selects onlythe necessary information source as a search object on the screen forthe determination of search objects shown in FIG. 6, and selects theEND! button after the selection of the information source, terminatingthe determination of search objects. The information source specified bythe determination of search objects is stored in the data area for asearch object a shown in FIG. 3. For instance, when the user selects twoinformation sources, "Japanese patent database" and "telex", thecontents of the table information of the two information sources arestored in the data area for the search object a, and the propositionscreen for the user returns to the initial screen shown in FIG. 5. Whenthe user selects "newspaper", the contents of the table information ofboth information sources, A newspaper and B newspaper, which are acollection of newspaper information sources, are set in the data areafor the search object a.

When the user selects SEARCH CONDITION!, the user's instruction isdetected in the input discrimination process (step B3) and the searchcondition is determined (step B8). In the determination of searchconditions, of the information source to be searched for (specifically,information to be obtained from "Japanese patent database" and "telex"determined above), user information for discriminating the informationdesired by the user from unnecessary information is determined. In thedetermination of search conditions in the system according to thisembodiment, the search condition is specified and determined by twoparameters, for example, a key word and a contents condition.

The determination of search conditions is performed by proposing ascreen for the determination of search conditions as shown in FIG. 8 tothe user. FIG. 8 exemplifies the case where the user has selected thesearch condition concerning a key word in the description of "naturallanguage processing | voice recognition & syntactic analysis". Thesymbol "|" in the description of the search condition indicates an ORcondition, while "&" means an AND condition. In the example shown inFIG. 8, therefore, two kinds of information are selected as searchconditions: information with a key word, "natural language processing",or (OR) information having two key words, "voice recognition" and (AND)"syntactic analysis". Extension of keywords and extension of thecontents condition in FIG. 8 will be described later.

The determination of search conditions with a group of keywords beingformed from the individual key words determined above, the determinationof search conditions is executed using that group as the object ofmatching which will be described later.

FIG. 9 shows the process for producing the key word group from theindividual key words. The process is performed in following steps. A keyword selected by a user is identified as an element of the key wordgroup (step C1). The key word selected by the user and a key wordsynonymical to it are identified as elements of the key word group inthe search condition portion of the user information unit (step C2). Akey word of lower concept than the above-described keywords, and asynonymical key word are identified in the key word group (step C3).

The key word synonymical to the selected key word, and the key word oflower concept, which have been identified in the steps shown in FIG. 9,are stored in the data storage unit 3. The key words stored in the datastorage unit 3 can be obtained referring to a word thesaurus as shown inFIG. 10. The word thesaurus indicates synonymical relationship betweenwords by eq!, and describes the relationship between words by linkingthe upper and lower concepts of the words with is-a!.

In an example as illustrated in FIG. 10, sizengengosyori (Japanese)! islinked to natural language processing! with eq!. In step C2 in FIG. 9,therefore, "natural language processing" is identified in the key wordgroup. Then, its synonymical key word and a key word of lower conceptare acquired and identified in the key word group; for example,"kikaihonyaku (Japanese)" as "machine translation", and further"shitumon-outou sisutemu (Japanese)" as "question answering system",respectively, are identified in the key word group. In this manner, thekey word selected by the user and a logical expression of key wordsoriginated from the selected key word are determined in the key worditems.

The key word specified by the user and key words originated from thatkey word are displayed as extended keywords to the user as shown in FIG.8. The user can delete or correct an unnecessary key word among theproduced key words. As a possible modification of this processing, ingenerating the key word group from the user-specified key word, the keyword thesaurus may be displayed as a list form, a tree-structured formor a network form, so that the user can select or add the necessarydata, or delete the unnecessary data from the display.

A contents condition as a search condition is determined by describingconditions about the contents of information as a search object.Generally, the determination of contents conditions is performed bysubjecting a conditional sentence written in a natural language to theanalysis of form elements, syntactic analysis and meaning analysis, andconverting the result of the analyses into a meaning matching template.

In the system of this embodiment, the contents condition is specified ina template form of

concerning . . . !

for example.

Specifically, when the user inputs "analysis of sentence" as theexpression of the contents, determination of contents conditions isperformed in the unit for determining contents condition.

FIG. 11 exemplifies the process of determining contents conditions. Whenthe user inputs "analysis of sentence", consulting a dictionary andanalysis into form elements are performed with respect to the givenexpression of the contents (step D1). A machine dictionary and rules forform-element analysis used in this process are prestored in the datastorage unit 3. After the dictionary-consulting and form-elementanalysis are completed by the usual method, a train of words, "analysis(verbal noun) of (preposition) sentence (noun)", is acquired accordingto the analysis structure shown in FIG. 12.

Based on the result of the analysis done in step D1, analysis of thesentence is executed using the syntax grammar stored in the data storageunit 3 (step D2). Under the rules of the syntax grammar, the inputexpression of the contents is determined as a noun phrase as a whole,which has a word, "analysis", linked to a word, "sentence", with apreposition, "of".

In the meaning analysis process, it is apparent from the sentencestructure acquired in step D2 that the word "analysis" is a main elementof the sentence, and dominates the word "sentence" with a semanticrelationship expressed by "of" (corresponding to a Japanesepostpositional article "no") (step D3). According to the acquiredmeaning structure, meaning matching templates corresponding to themeaning structure, such as "sentence analysis", "analyze sentence","analyzed sentence", and "analysis of sentence", are produced as shownin FIG. 12 in step D4. The preparation of matching templates in thismanner permits meaning matching with another expression simply by, forexample, the user's specifying the description of the contents about thesearch condition for "analysis of sentence".

It is also possible to produce matching templates to another expression,such as "recognition of sentence (analysis of phrase)", by incorporatinga matching process for a synonym and associated words shown in FIGS. 9and 10. When the process in FIG. 11 is executed, a key word and acontents condition are determined as user information in the searchcondition section b of the user information unit in FIG. 3, and thescreen returns to the initial screen in FIG. 5. Based on the searchcondition the user has input, the user can change (update) or delete themeaning templates produced through the process in FIG. 11. In theextension of the content conditions in FIG. 8, an object to be matchedwith a meaning template corresponding to "analysis of sentence" isexpressed in the form of a phrase. The user can pick up an expressionappearing inadequate from the matching objects expressed in phrase form,and correct or delete it, for example.

Determination of proposition contents will now be explained.

When the user has selected PROPOSITION CONTENTS! on the initial screenin FIG. 5, the determination of proposition contents is invoked in stepB9 through the input determination processing (step B4) in FIG. 4. FIG.13 shows an example of a screen for the determination of propositioncontents to be proposed to the user when the determination processing isinvoked. Various available information sources individually generateinformation having various formats. In the determination of propositioncontents in FIG. 13, basically, information from the information sourcesis classified into ranks by the importance of the contents. Morespecifically, patent information provided from the patent databaseincludes "title", "inventor", "belonging to; applicant", "claims","description of the invention; specification", and "drawings", and it isdetermined in the process shown in FIG. 13 which information contentsshould be proposed to the user.

For example, on the screen for the determination of proposition contentsin FIG. 13, the importance according to the contents of information isgiven by the user to the elements of the patent information as "title;title of the invention", "company; applicant" and "name; inventor" inthe first rank, "summary" and "claims" in the second rank, "drawings" inthe third rank, and "description of the invention; specification" in thefourth rank.

The contents and formats of information obtained from the informationsources are determined by the types of the information sources in FIG.7. It is therefore possible to propose information about the types ofthe information sources to the user in order to guide the determinationof proposition contents. User information whose proposition contents aredetermined as described above is identified in the proposition contentsection c in the user information unit shown in FIG. 3.

When the user selects DETERMINATION OF PROPOSITION METHOD!,DETERMINATION OF DATA PROCEEDING!, or DETERMINATION OF DATAMODIFICATION! on the initial screen in FIG. 5, that selection isdetected in the input determination process in step B5 in FIG. 4. TheDETERMINATION! of proposition method, etc. is then invoked (step B10).FIG. 14 exemplifies a display screen in the case where any of thementioned determination processes are executed. The user interactivelydetermines four items, PERIOD!, PROPOSITION METHOD!, ERASE TIMING!, andDATA MODIFICATION!, through the screen in FIG. 14.

The item PERIOD! defines when desired information source should beproposed if the desired information is obtained through the aboveprocess. If "Monday, 10:00" has been selected, as shown in FIG. 14, datawhich specifies that collected information will be proposed to the userat ten o'clock every Monday is stored in the data storage unit 3.

To access an external database, the timing for accessing externaldatabase has only to be selected in the same manner. In the exampleshown in FIG. 14, the database is set to be accessed at nine o'clockevery Monday.

The item PROPOSITION METHOD! specifies means (proposition form) forproviding information for the user. In the example in FIG. 14, thatmeans is specified so that information proposition is performed using anelectric mail EMAIL!. It is also possible to specify a report namedefined by the user to propose a collection of information acquired fromseveral user information units. For example, if EMAIL-WEEKLY! isspecified to the PROPOSITION METHOD!, multiple pieces of information arepacked in one mail before proposition to the user.

ERASE TIMING! is an item for specifying the timing for erasing datawhich has been fetched from an external unit and stored in the datastorage unit. In FIG. 14, since information "one week after proposition"is set in the system, information which has been proposed to the userwill be erased one week later.

DATA MODIFICATION! is user information for specifying the format ofprocessing to be added to information obtainable from the externaldatabase. The user information is determined, for example, as"translation to English, output to paper" as shown in FIG. 14. In thatcase, collected information is translated into English, and thetranslated information is then printed on given paper. The userinformation to be determined as shown in FIG. 14 is stored in theindividual items of the proposition method d, data proceeding e and datamodification f. Through this step, the identification process in theuser model determination unit 4 is completed.

The user model edit processing AS in FIG. 2, which is for performingchanging, addition and erasing on various data identified in the usermodel determination unit 4, is basically executed in the same manner asthe identification processing.

The control unit 7 which performs the general control of the systemaccording to this embodiment basically operates in accordance with theroutine shown in FIGS. 15A and 15B.

First, the control unit 7 determines whether or not any information hasbeen input through the external interface unit 2 (step El). When thereis external information, the control unit 7 checks if there is a userinformation unit which has the source of the input informationdetermined as a search-object (step E2). If such a user information unitexists, the externally-input data area and the user information unithaving the source of the externally-input information as the searchobject are sent to the information drawing-out unit 5 where aninformation drawing-out process is executed (step E3). When there is noinformation input from the external interface unit 2 or no userinformation unit exists from information source of external inputinformation as a search object, the control unit 7 searches the userinformation units already identified, and checks if there is aninformation source accessible to an external database at the time of thesearching (step E4). If there is an accessible information source, datato be accessed is fetched from the user information unit, andinformation accessing is performed through the external interface unit 2in accordance with the accessing process stored in the data storage unit3 to thereby acquire external information (step E5). When externalinformation is input through steps E4 and E5, the processing startingwith step E2 will be executed.

If there is no accessible information source at the time thedetermination is made in step E4, it is then determined whether or notthere is a user information unit which should be proposed to the user(step E6). If such a user information unit exists, the control unit 7activates the information proposition processing unit 6 to executeinformation proposition processing (step E7).

If there is no user information unit to be proposed to the user, or ifthere is no information proposition request from the user, the controlunit 7 determines whether or not the user has made a user modeldetermining or changing request (step E8). If there is a user modeldetermining or changing request, the control unit 7 activates the usermodel determination unit 4 to perform the aforementioned user modeldetermination (step E9).

Further, the control unit 7 searches the user information unit todetermine if there presently exists information that should be erased(step E10). If such information to be erased exists, that information iserased from the data storage unit 3 (step E11).

The processing of the information drawing-out unit 5 will now bedescribed.

The information drawing-out unit 5 basically checks if information inputthrough the external interface unit 2 matches with various predeterminedconditions stored in the user information unit. When the inputinformation matches with the conditions, it is stored and held in thedata storage unit 3. Specifically, the information drawing-out unit 5has a function to filter input information at a high speed and afunction to analyze and determine the input information in more detail.

FIG. 16 illustrates the flow of the operation done by the informationdrawing-out unit 5.

The information drawing-out unit 5 refers to the type of the informationsource of information to be a search object, and determines whether ornot a key word is added to the information given from the informationsource (step F1).

If key word information is affixed to the given information, a key wordgroup acquirable from the given information is subjected to matchingwith a key word condition in the search conditions in the determineduser information unit (step F2). The matching process in step F2 isrealized by determination of a logical operation. When the matching withthe key word fails in step F2, it is determined that the inputinformation is not what is desired by the user, and the matching processwill be terminated (step F3).

When no key word is affixed to the input information (step F1), amatching process between individual words constituting a text portion inthe given input information and the key word condition indicated by thesearch conditions in the user information unit is executed over theentire text (step F4). When the matching with the key word fails in stepF4, it is determined that the input information is not what desired bythe user, and the key word matching process will be terminated (stepF5).

When the key word matching is successful in step F2 or F4, it isdetermined that the input information is what the user wants, and thetext portion in the input information which has resulted successful inthe matching process is drawn out (step F6). Thereafter, texts are drawnout one by one from the drawn-out text portion and the contents of eachdrawn-out text are checked.

Then, it is determined if a contents condition is specified in thesearch conditions in the user information unit (step F7). When thecontents condition of the search condition in the user information unitis specified, it is determined if the processing has progressed to theend of the text portion (step F8). If the processing has progressed tothe end of the text portion in step F8, the processing will beterminated. Otherwise, one text is drawn out from the text portion ofthe input information, and this text is subjected to a form-elementanalysis, syntactic analysis and meaning analysis to acquire thesemantic structure of the drawnout text (step F9).

Then, the text's semantic structure resulting from the analyzing processis subjected to meaning matching with the contents condition indicatedby the user information unit (step F10). Then, the result of thematching in step F10 is determined (step F11). When the meaning matchingis successful in step F11, the input information is determined as theone the user wants. An identifier of the user information unit isaffixed to the determined information, and the resultant information isstored in the data storage unit 3 (step F12).

Through the above procedures, only the information matching with theuser information unit can be accumulated in the data storage unit 3 andcan be collected.

The meaning matching process will now be described, giving a specificexample.

Assume that a meaning matching template as shown in FIG. 12 is stored inthe contents condition section in the user information unit. When asentence

This system analyzes simple sentences.! is included in the text in theinput information which is to be subjected to the matching process, theresult of the analysis of the semantic structure for this sentence wouldbe as shown in FIG. 17.

When each portion of this semantic structure is subjected to templatematching for the aforementioned meaning matching template, matchingsucceeds at the portion

analyze-object-sentence!.

Because of the successful matching process, in this example, it isdetermined that the user-specified condition "analysis of sentence" hasthe same meaning as a text "analyze sentence".

The operation of the information proposition processing unit 6 will nowbe described.

The information proposition processing unit 6 is invoked when thecontrol unit 7 determines in step E6 in FIG. 15B that there is a userinformation unit to be exist in this step, or when the user requestsinformational proposition.

FIGS. 18A and 18B exemplify the sequence of processes of the informationproposition processing unit 6. When the information propositionprocessing unit 6 is invoked, multiple pieces of information stored bythe information drawing-out unit 5 are stored in the data storage unit 3together with identifiers of the corresponding user information unitsand information concerning a data-collecting period. The informationproposition processing unit 6 first draws out all information which iscurrently a proposition object from the information in the data storageunit 3, and determines a proposition information area for preparinginformation to be proposed to the user (step G1).

The input information and the user information unit are stored in thedata storage unit 3 as shown in FIG. 19, for example. Referring to FIG.19, x is the identifier of the user information unit, y is informationdetermined through the external interface unit 2 by the informationdrawing-out unit 5, z is information of the time at which the system hascollected information, and w is pointer information (capable of storingmore than one piece of information) for- information which is fetchedfrom the user information unit based on that user information unit.WEEK-EMAIL! and MONTH-EMAIL! are used as a method for proposinginformation in the user information unit x.

The user information relating to the above proposition method is a titlefor proposing multiple pieces of information as one, and is linked asinformation represented individually in FIG. 20A and 20B. It isdetermined in the example in FIG. 20A that the proposition timing ofWEEK-EMAIL! is "ten o'clock every Monday", the total amount ofinformation to be proposed is "two A4-size pages of information", andthe proposition method is "propose to USER 1 by electric mail".

In the process in step G1 in FIG. 18A , referring to the information,the information proposition processing unit 6 links information to beproposed, and draws it out. As a result, "information 1" and"information 3", both stored in the user information units havingWEEK-EMAIL! as the proposition method, will be set as propositionobjects in the proposition information area on Monday, at ten o'clock.After setting the information in the proposition information area,information about the proposition method is read out from the datastorage unit 3 (step G2). Through the process in step G2, with regard tothe WEEK-EMIL!, user information to the effect that the total amount ofinformation is two A4-size pages of information and the information isto be proposed to the USER1 by electric mail will be fetched out.

In the case of user information which, unlike the mentioned WEEK-EMAIL!,does not exist over multiple user information units, propositioninformation specified by an individual user information unit is fetchedout.

In step G1, pointer information referring to FIG. 19 to informationpresent in the data storage unit 3 is set in the proposition informationarea, so that information (unsummarized, etc.) accumulated in the datastorage unit 3 can be referred to using the pointer information.

After information is read out from the data storage unit 3 in the abovemanner, it is determined if the total amount of information set in theproposition information area exceeds the amount of information specifiedby the user (step G3). When the amount of information set in theproposition information area exceeds the user-specified amount ofinformation, information in proposition contents areas of the userinformation units for the respective information are checked and it isthen determined whether or not information of a lower rank to theinformation set in the proposition information area can be erased (stepG4).

When erasing the information of a lower rank is determined to bepossible in step G4, this information is erased from the propositioninformation area to reduce the total amount of information in theproposition information area (step G5). Thereafter, the sequence ofprocesses starting with step G3 will be repeatedly executed. When it isnot possible to erase information of a lower rank, it is checked if theinformation in the proposition information area has already beensummarized (step G6). If a summarizing process has not been performedyet, summarization of that information is done to reduce the informationin the proposition information area (step G7), before the processsequence from step G3 is repeated.

The summarization will be briefly explained below. The process ofsummarizing information progresses in accordance with the process flowis shown in FIG. 21, for example. This summarization starts with theextraction of one piece of information from the proposition informationarea (step H1). Then, it is determined whether or not summaryinformation is affixed to the extracted information (step H2).

The determination process is carried out by searching for a key word,such as "summary", "abstract" or "first" and drawing out the textportion from the searched key word to the next chapter or next passageor paragraph. In this process, when it is determined that summaryinformation is affixed to the information to be proposed, textinformation excluding the summary portion in the information is erased,and the remaining portion is set in the proposition information area(step H3). When the drawn-out information has not been determined ashaving summary information affixed thereto, it is then determinedwhether or not that information has "title", "chapter head", or the like(step H4). Generally, in step H4, a chapter head has a structure of"numeral+a train of characters", such as "1. Process flow". Therefore,it is possible to determine and detect a chapter head by drawing out apattern of characters, such as the aforementioned chapter head. When asentence having "title", "chapter head" or the like is detected anddetermined, text information other than the "title" or "chapter head" inthat sentence is erased, and the remaining information is set in theproposition information area before the process is terminated (step H5).

When "title", "chapter head" or the like is not detected, a middleportion excluding the first and last portions of the information isreplaced with an ellipsis ". . . " and the amount of information theellipsis ". . . ". replaces is set as deleted information in theproposition information area (step H6). The first portion and lastportion determined in step H6 may be determined as the first and lastparagraphs in the text information, or a given number of charactersrespectively from the first and last characters in the text information.Further, which portion to extract may be changed in accordance with theamount of information. The summarization of information is executedthrough the above-described processing to reduce the amount ofinformation in the proposition information area, so that the totalamount of information can fall within the desired range.

When the total amount of information in the proposition information areafalls within the user-specified limit, or when it is not possible toreduce the information further, it is determined whether or not the userinformation unit designates data modifications to that information (stepG8). When information about a data modification is identified in theuser information unit by the user, the designated data modification isperformed (step G9). For instance, when data modification, "translationinto English", is specified, a Japanese sentence is machine-translatedinto English, and the result of the translation is set in theproposition information area. Then, it is determined whether or notEMAIL! is specified as an information proposition method (step G10).When EMAIL! is specified, the contents of information in the propositioninformation area are transmitted via electric mail to the specifieduser, and the proposition process is terminated (step G11). When EMAIL!is not specified, it is determined if the specified proposition methodis "output to paper" (step G12). When "output to paper" is specified,the contents of the proposition information area are output on an outputunit, such as an LBP, via the user interface unit 1 to propose it to theuser (step G13).

It is also determined in step G14 if the specified proposition method is"output to file". When "output to file" is specified, the contents ofthe proposition information area are output to a specified file (stepG15). It is also determined in step G16 if the specified propositionmethod is "display in real time". When "display in real time" isspecified, it is checked if the real-time display of the information tothe user is possible (step G17).

The determination of the possibility of the realtime display in step G17is performed by checking if the user is using a computer system at thetime of information proposition. If such real-time display is possible,the contents of the proposition information area are proposed throughthe user interface unit 1 using the screen the user is using (step G18).

When none of the proposition methods mentioned above is specified,information in the proposition information area is proposed to the userutilizing other proposing means (step G19).

A specific example of using the information collection system having theabove structure will be described below.

Consider that the case where two user information units as shown inFIGS. 22A and 22B, for example, using the user model determination unit4 are determined.

In the user information unit #1 in FIG. 22A, "Japanese patent database"is specified as a search object, while the same search condition asshown in FIG. 8 is specified for the search object. Proposition contentsare specified with "title", "company" and "name" as the highest rank,"summary" as the second rank, and "text" as the third rank. WEEK-EMAIL!is specified as the proposition method. The proposition method isspecified in this example as the one shown in FIG. 20, namely "timing:Monday 10 o'clock, total amount: two pages in A4 size, method: EMAIL TOUSER1".

The user information unit #2 in FIG. 22B specifies "telex" line as asearch object and "public works" as a key word that is a searchcondition. Further, WEEK-EMAIL! is specified as the proposition methodfor the user information unit #2 as in the case of the user informationunit #1.

When the user information unit #1 or #2 is identified in the datastorage unit 3 of the system according to this embodiment, the controlunit 7 starts information collection through the external interface unit2. At the beginning of the information collection, the control unit 7detects that the reporting timing (timing for information proposition)for WEEK-EMAIL! is 10 o'clock on Monday, and finds out that the type ofthe "Japanese patent database" as the search object of the userinformation unit #1 is "database" referring to the table shown in FIG.7. The control unit 7 therefore accesses the "Japanese patent database"at 10 o'clock on Monday. Through the data access, patent data which hasbeen registered in the "Japanese patent database" since the last Monday,10 o'clock will be drawn out.

When patent data as shown in FIG. 23 is acquired by the above dataaccess, the control unit 7 sends the acquired patent data to theinformation drawing-out unit 5. Based on the search condition indicatedin the user information unit #1, the information drawing-out unit 5determines if matching with information data is needed. In this example,for patent A, a key word condition "voice recognition & sentenceanalysis" is matched with the determined user information unit, and thetext portion "syntactic analyzing process" (object) present in the textis semantically matched with the contents condition of the userinformation unit #1, "analysis of sentence". As a result, patent A isdetermined as information which is matched with the user's request inaccordance with the search condition of the user information unit #1,and it is stored together with the identifier to the user informationunit #1 into the data storage unit 3.

With regard to patent B, patent data, shown in FIG. 23, likewise, the"key word condition" is matched with the "contents condition", so thatthe information of patent B is stored together with the identifier tothe user information unit #1 into the data storage unit 3.

Since patent C shown in FIG. 23 does not match with the "key wordcondition", it will be excluded from what should be stored in the datastorage unit 3. That is, the information of patent C is not stored inthe data storage unit 3. In the above manner, of patents A, B and Cobtainable from the "Japanese patent database", only the first two,patents A and B, are stored in the data storage unit 3 as information tobe proposed to the user.

Information collection for the user information unit #2 will be executedas follows. When a telex as shown in FIG. 24 is sent over a telex line,for example, the control unit 7 detects information input through theexternal interface unit 2 and activates the information drawing-out unit5 in accordance with the process in step E1 shown in FIG. 15A. When theinput information is telex information, a key word is not added to thetelex information itself. Matching between a word in the telex text anda key word "public works" is performed.

Since the text in a telex A in FIG. 24 includes a text portion saying"the budget for the public works was increased", the key word conditionindicated by the user information unit #2 is fulfilled. As no particularcontents condition is specified in the user information unit #2,however, the telex A is directly determined as information the userwants. As a result, the information of the telex A is stored togetherwith the identifier of the user information unit #2 into the datastorage unit 3.

Since a telex B shown in FIG. 24 does not contain the words "publicworks", it fails to fulfill the key word condition. The telex B is nottherefore stored in the data storage unit 3.

In the above state for information collection, the control unit 7activates the information proposition processing unit 6 to perform theproposition of WEEK-EMAIL!.

FIG. 25 illustrates an example of proposition information prepared bythe information proposition processing unit 6. The propositioninformation shown in FIG. 25 is information acquired from the "Japanesepatent database" as information about an information source. Althoughthere are five items, "title", "company", "name", "summary" and "text",specified as the proposition contents of the user information unit #1,the total amount of all the information in the "Japanese patentdatabase" exceeds the total amount specified by the aforementionedWEEK-EMAIL!. FIG. 25 therefore illustrates the information as proposedwith information of "text" of a lower rank eliminated by the informationproposition processing unit 6. Information corresponding to the userinformation unit #2 is proposed in such a way that the middle portion ofthe telex information text is omitted and the resultant information isincorporated into the aforementioned database information.

In FIG. 25, "%105", "%109" and "%15" are pointers to the originalinformation stored in the lower concept in step G1 in FIG. 18A. Thesepointers permit access to the original information stored in the lowerconcept. Accordingly, the user can request proposition of the originalinformation using such a pointer as a key. With the use of a hypermedia, the pointers for data access can be realized as pointers on theproposition screen, in which case the user can easily access theoriginal information by clicking the desired button on the propositionscreen.

According to the information collection system embodying the presentinvention, it is possible to automatically collect information specifiedby a user, and propose the information at the user-desired timing in theuser-desired form. This produces practically significant effects, suchas the desired information being collectible accurately and in a surelyarranged form without overlooking information collection.

The present invention is not limited to the abovedescribed embodiment.For instance, while the system according to this embodiment has beendescribed referring to the case where access to database data input froma telex line are performed, this invention may be applied to an accessto various information sources, such as another network news, a personalcomputer communication, and a computer electric mail.

While data to be handled in the system of this embodiment has beendescribed as character data, this system may be applied to various formsof information, such as a chart or diagram, a picture image, a speech,numeral data, in accordance with how to output data of that informationsource. With regard to proposition of information to a user, it is notlimited to electric mail or real-time display on a screen, but acomputer-controllable communication media, such as access to the userthrough a telephone, may be properly used as needed.

Although the data modifying process is exemplified as translation ofinformation in the system of this embodiment, various other operationsare also possible, such as acquisition of a history about information orstatistic information, automatically outputting a command to the outsideof the system and sending information via electric mail to other users.While the summarization is performed on all the information to reducethe amount of information, it is possible to take some measures tofurther greatly reduce the amount of data whose validness drops with thepassage of time. The present invention may be modified in various otherways within the scope and spirit of the invention.

Additional advantages and modifications will readily occur to thoseskilled in the art. Therefore, the invention in its broader aspects isnot limited to the specific details, representative devices, andillustrated examples shown and described herein. Accordingly, variousmodifications may be made without departing from the spirit or scope ofthe general inventive concept as defined by the appended claims andtheir equivalents.

What is claimed is:
 1. An information collection system connected to acommunication network for receiving a plurality of information and forcollecting desired information from said communication network,comprising:user interface means for executing input/output informationwith respect to a user; external interface means for exchanging varioustypes of information with a plurality of external units connected tosaid communication network; user model determination means for preparingfirst user-information for discriminating and modifying informationrequired by said user, and second user-information for defining aninformation proposition method in a form of a user-desired format;information drawing-out means for drawing out information input fromsaid external interface means in accordance with said firstuser-information; information modify means for modifying the drawn outinformation into a user-desired information by performing desiredprocessing in accordance with the first user-information; informationproposition processing means for converting information modified by saidinformation modify means into a proposition form for said user inaccordance with said second user-information; output means foroutputting information converted by said information propositionprocessing means; and control means for receiving information from saidindividual means mentioned above, and executing control of saidindividual means and information exchange therewith in accordance withsaid received information, wherein said information modify meansincludes information translating means for analyzing a sentence andmeaning of the drawn out information and translating the drawn outinformation from a first language to a second language in accordancewith the first user-information.
 2. An information collection systemconnected to a communication network for receiving a plurality ofinformation for collecting desired information from said communicationnetwork, comprising:user interface means for executing input/outputinformation with respect to a user; external interface means forexchanging various types of information with a plurality of externalunits connected to said communication network; user model determinationmeans for preparing first user-information for discriminating andmodifying information required by said user, and second user-informationfor defining an information proposition method in a form of auser-desired format; information drawing-out means for drawing outinformation input from said external interface means in accordance withsaid first user-information; information modify means for modifying thedrawn out information into a user-desired information by performingdesired processing in accordance with the first user-information;information proposition processing means for converting informationmodified by said information modify means into a proposition form forsaid user in accordance with said second user-information; and outputmeans for outputting information converted by said informationproposition processing means, wherein said information modify meansincludes information translating means for analyzing a sentence andmeaning of the drawn out information and translating the drawn outinformation from a first language to a second language in accordancewith the first user-information.
 3. In an information collection systemconnected to a communication network for receiving a plurality ofinformation for collecting desired information from said communicationnetwork, an information collection method comprising the stepsof:executing input/output information with respect to a user; exchangingvarious types of information with a plurality of external unitsconnected to said communication network; preparing firstuser-information for discriminating and modifying information requiredby said user, and second user-information for defining an informationproposition method in a form of a user-desired format; drawing out inputinformation in accordance with said first user-information; modifyingthe drawn out information into a user-desired information by performingdesired processing in accordance with the first user-information;converting information modified by said information modify means into aproposition form for said user in accordance with said seconduser-information; outputting converted information; and receivinginformation from said individual steps mentioned above, and executingcontrol of said individual steps and information exchange therewith inaccordance with said received information, wherein said modifying stepincludes substeps of analyzing a sentence and meaning of the drawn outinformation, and translating the drawn out information from a firstlanguage to a second language in accordance with the firstuser-information.
 4. In an information collection system connected to acommunication network for receiving a plurality of information forcollecting desired information from said communication network, aninformation collection method comprising the steps of:executinginput/output information with respect to a user; exchanging varioustypes of information with a plurality of external units connected tosaid communication network; preparing first user-information fordiscriminating and modifying information required by said user, andsecond user-information for defining an information proposition methodin a form of a user-desired format; drawing out input information inaccordance with said first user-information; modifying the drawn outinformation into a user-desired information by performing desiredprocessing in accordance with the first user-information; convertinginformation modified by said information modify means into a propositionform for said user in accordance with said second user-information; andoutputting converted information, wherein said modifying step includessubsteps of analyzing a sentence and meaning of the drawn outinformation, and translating the drawn out information from a firstlanguage to a second language in accordance with the firstuser-information.